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METHOD OF NORMALIZING SOFTWARE 
USAGE DATA FROM MAINFRAME COMPUTERS 

RELATED APPLICATION 

This Application claims priority and is entitled to 
5 the filing date of U.S. Provisional Application Serial 

No. 60/188,380 filed March 10, 2000, and entitled 
"METHOD OF NORMALIZING SOFTWARE USAGE DATA FROM 
MAINFRAME COMPUTERS, " the contents of the provisional 
patent application are incorporated by reference herein. 

10 

BACKGROUND OF THE INVENTION 

In the area of computing, the performance of 
software is highly dependent upon the performance (i.e., 
speed) of the hardware. Common benchmarks for hardware 

15 performance are usually stated in terms of drystones, 

whetstones, Millions of Instructions Per Second (MIPS) , 
Million Service Units (MSU) , etc. The values derived are 
highly influenced by various factors including the 
processor, memory size and speed, cache memory, hardware 

20 peripherals, bus speed, operating system, etc. 

Thus, for a given computer system, the typical 
method for comparing the usage of one or more software 
products is usually established relative to that 
configuration. Hence, a usage factor obtained for a 

25 software product on one computer system, all other 

conditions being equal, may be dramatically different 
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when run on a different computer system. For example , a 
product taking 100 CPU-seconds on a 3 00 MIPS processor 
may use only 75 on a 4 00 MIPS system for the very same 
processing task. 
5 XSLM- compliant licensing systems collect and record 

data about the usage of the licensed products and 
relevant events related to license management. Other 
products, such as SoftAudit from Isogon and FlexLM from 
Globetrotter, collect usage data, provide usage 
10 statistics, and produce reports. None of these systems 

and products provides a means for the user to compare 
usage in a dynamically changing environment. 

LicensePower/MVS from Isogon provides the user with 
the ability to "scale" usage statistics however, such 
15 measurements are for static configurations. The user 

must manually select the time intervals of choice (hour, 
day, week, or month) and the appropriate scale factors 
that are to be applied for each computer system. 

For the most part, products that collect and report 
20 usage statistics do so for static configurations and 

other products that report on environmental changes (of 
the computing system) do so independently of one 
another. This is illustrated in prior art Figure 1. 

System 10 is the usage data auditing and collection 
25 system, Isogon' s SoftAudit product being an example 

thereof. The system 10 is juxtaposed to the system 30 
in Figure 1 which is used for measuring the capacity of 
a computer over time. The typical software usage 
monitoring system includes a first facility 12 which 
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collects software usage data and stores it in a usage 
log 14. The block 16 extracts usage data and stores 
software product usage in a log 18. The block 2 0 
generates various reports on software product usage. 
5 In the system 30, the first software block 32 waits 

for changes in capacity to occur. As changes occur, 
they are detected and indications thereof are noted as 
"events" which are stored at step 34 in event log 36. A 
capacity report is then produced by the capacity report 

10 generator 38, which defines how and when the capacity 

(i.e., performance characteristics of the computer) has 
changed over selected time periods. In the prior art, 
the outputs and functionalities of the systems 10 and 3 0 
have not been interfaced or correlated with one another. 

15 Thus, in an environment where computer systems are 

partitioned (i.e., S/390 LPARs or contain multiple 
processors) and the capacity and number of the different 
partitions within these system can be dynamically 
changed as processing needs change, the measurement and 

20 comparison of software usage and licensing fees (which 

are often based on the computing capacity of the 
computer on which the software will run) may become 
skewed as these changes in computing capacities occur. 

SUMMARY OF THE INVENTION 
25 Accordingly, it is an object of the present 

invention to provide a means of extracting data 
regarding the change in computing capacity from various 
information logs; to generate output records of this 
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data; to provide this data to other programs; and to 
perform various statistical and normalization 
calculations and report the results. 

It is another object of the present invention to 
5 combine computing capacity data with software usage data 

produced by other programs; to generate output records 
of this combined data; to perform calculations that 
normalize the usage data; to provide this data to other 
programs; and to perform various statistical and 

10 normalization calculations and report the results. 

It is yet another object of the present invention 
to generate output data records in a format that is 
compatible with the reporting programs of the products 
that produced the original software usage data so that 

15 they may be used to produce reports using normalized 

usage data. 

The aforementioned and other objects of the 
invention are realized by an aggregation of software 
programs which carry out a variety of tasks that obtain 

20 results that are usable both independently and in 

combination. Thus, the present invention employs a 
first software program which runs substantially 
continuously on a computer and which monitors and 
records data that provides a measure of the capacity of 

25 the computer over specified time periods. This 

information is useful by itself or as input to other 
programs that perform various statistical and 
normalization calculations on the results. 

In a further developed construction of the 
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invention, the results obtained by the first software 
program are provided to a software program usage monitor 
that gathers information about the usage of software 
products on the computer, the results of both programs 
5 being combined and normalized to restate software 

program usage data in a manner that reflects changes in 
computer capacity over time. As an option, the restated 
usage data is cast in a form and format that is 
compatible with the existing format of the software 
10 program usage reports. 

Other features and advantages of the present 
invention will become apparent from the following 
description of the invention which refers to the 
accompanying drawings . 

15 BRIEF DESCRIPTION OF THE DRAWINGS 

Figure 1 is a block diagram of software product 
usage monitoring and computer capacity tracking 
programs . 

Figure 2 is a flow chart illustrating the 
20 normalization of computer product usage data relative to 

computer capacity event data. 

DETAILED DESCRIPTION OF EMBODIMENTS OF THE INVENTION 
Hereinafter, the term "computing index" or "CI" 
means or represents a measure of the processing power of 
25 a computer or CPU. Typical computing indices are a 

combination of one or more of MIPS, MSUs, CPU speed, 
number of processors, drystones, whetstones, Model Group 
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or other such indices. The description below refers at 
times to software elements that are identified by the 
numerals appearing in Figures 1 and 2. 

The licensing fees charged by vendors for software 
5 used by large data centers is most often based upon the 

size of the CPU that the software is run on. Users are 
either charged a fixed amount according to which of 
their CPUs fall into a particular class (more commonly 
known as a "model group") or, they are charged according 

10 to the speed (MIPS rating) of a specific CPU. There is 

no common basis or standard definition of a model group. 
Each vendor may establish his own model group pricing 
schedules separate and apart from what other vendors may 
use. For example, ABC Corp. may list seven processor 

15 groups for software product covering all HAL processors 

and another company may define twenty groups for those 
same processors . 

The present invention consists of a number of 
components that can be assembled in building-block 

20 fashion such as a single program containing the 

functionality of one or more of the components; as 
separate programs that are independently executed; as a 
program that executes as the result of an API 
(Application Program Interface) call from one of the 

25 other components; as an exit routine from another 

component; or as combinations of the above. 

Knowledge Base (KB) : The KB 42 is a list or 
database that correlates various computing indices 
according to any of CPU, CPU to Manufacturer, Vendor to 
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Vendor's Model Group, etc. For example, if an 
information log lists the CPU as being a HAL-1000, the 
KB entry for that CPU will contain the appropriate CI 
for that model and other relevant information. Table 1 
5 is a sample of what information might be contained in 

the KB. 

For example, as demonstrated by the presence of 
values for MIPS, et al . or other means (not shown), the 
HAL-1000 CPU is manufactured by the HAL Computer Corp. 
10 and is designated as belonging to their Model Group A. 

Two other vendors, ABC Corp. and XYZ Software designate 
the same CPU as belonging to their Model Groups C and D, 
respectively. 

Access to information in the KB 42 database can be 
15 made directly or, for example, through an API call to a 

process that first extracts and then returns the 
appropriate information. In the latter case, various 
types of API calls can be made that if supplied, for 
example, with the CPU model number, return the CI in 
20 MIPS; the model group applicable to the supplied CPU; 

etc . 

Table 1 - Sample Knowledge Base 



CPU MIPS MSU LPAR Manufacturer Model 

Group 

HAL-1000 150 175 1 HAL Computer A 

Corp. 

25 HAL-1000 ABC Corp. C 

HAL-1000 XYZ Software D 

HAL-2000 210 260 1 HAL Computer C 



Corp. 



00488539.1 




-8- 



HAL-2000 
HAL-2000 
HAL-9000 



20000 35000 



64 



ABC Corp. 
XYZ Software 
HAL Computer 



E 
H 
S 



Corp . 



10 



15 



20 



25 



00488539.1 



Capacity Data Extractor (CE) : The CE 30 (Figure 1) 
is a facility which extracts information regarding 
changes in computing capacity that has been separately 
gathered and recorded in information logs by a 
monitoring program, the operating system, Technical 
License Managers (TLMs) and other programs as 
appropriate. The information logs may contain specific 
fields for capacity information, a sequential stream of 
text messages, or other known formats. Using heuristics 
and other techniques, the CE 3 0 interprets these 
messages and fields to extract the appropriate 
information. The CE 30 will, according to user- specif ied 
parameters : 

• apply a filter to the capacity information data; 

• returns a CI or other such capacity information 
that corresponds to the earliest extracted event, 
e.g., the MIPS value that was in effect for the 
very first extracted event; 

• optionally uses the knowledge base 42 to lookup and 
substitute the appropriate CI for the CPU model or 
other identifying data extracted from the 
information logs; 

• performs user-specified calculations and outputs 
data records of those calculations; 

• outputs data records of (raw) computing capacity 
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event data 

Furthermore, the user can provide extraction 
(filter) specifications such as: 

• a particular computer system, CPU or LPAR; 
5 • a particular location or enterprise; 

• a period of time 
Optionally, the CE 30: 

• extracts and returns or stores data in response to 
an API call from another process 

10 • extracts and processes data as a exit routine from 

another process 

• stores output data in a file or database according 
to a user-specified format such as comma separated 

• variables (CSV) , tab separated variables (TSV) , 
15 plain text, XML, etc. 

• accesses the information logs of one or more 
computer systems from a remote location using a 
communication network or dial-up access. 

• accesses the information logs from one or more 

20 remote computer systems which have been downloaded 

to the computer system upon which the CE 3 0 
executes . 

• sends extracted data to another computing facility, 
for example, a central clearinghouse of such data. 

25 Minimally, each CE output data record contains the 

timestamp (date and time or at least date) of the event 
and the new computing index. Optionally, other relevant 
information that is output includes the identity of the 
computer system, processor, LPAR, location, software 
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product, etc. If this information is not contained 
within the information log, the CE 3 0 provides this 
using data from other system configuration records or a 
user-provided configuration list. For example, a minimal 
5 record contains only two fields: TIMESTAMP and CI. A 

more detailed record contains fields such as TIMESTAMP, 
CI, PRODUCTNAME, USAGE, LPAR, and LOCATION. 

When the CE 3 0 encounters an event in the capacity 
information log that is not an actual CI, such as CPU 

10 model, it substitutes the appropriate computing index to 

the output record by using the knowledge base (KB) 42 
which also correlates various computing indices to CPU, 
CPUs to model groups, etc. For example, if the 
information log (see Table 2) reflects that the HAL-1000 

15 has been upgraded to the HAL-2 0 00, the CE 3 0 looks up in 

the KB 42 the MIPS computing index of the HAL-2000 which 
is 210 MIPS and outputs that as an event record in the 
event log 36. 

Table 2 - Sample Capacity Information Log for a Multi-computer Installation 

20 Timestamp System LPAR Info 

1/1/99 Groucho 1 HAL-1000 Installed 

00 :15 

6/15/99 Groucho 1 HAL-2000 Upgrade Completed 

15 : 11 

25 6/17/99 Atlas 1 IBM 3090/500J De-installed 

12 :01 

6/20/99 Groucho 1 MIPS increased to 225 

00 :53 



00488539.1 




# 



- 11 - 



6/22/99 



Harpo 



1 



HAL-1000 Installed 



11:11 
6/25/99 



Groucho 



1 



MIPS increased to 250 



00:00 
7/1/99 



Groucho 



0 



HAL-9000 Installed 



16:12 



10 



15 



20 



25 
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For example, the information stored in a system 
configuration log (Table 2) may contain information 
regarding the addition or removal of processors or the 
change in processing capacity of one or more computer 
systems, one or more logical partitions within one or 
more computers, or a network or sysplex of computers. 
The user may desire to output the raw capacity event 
data; or produce certain capacity statistics such as 
average CI, high watermark CI, number of CPUs, etc. each 
filtered according to user-specified parameters. Such 
statistics may be given over a user-selected period of 
time such as, for example, average daily value for each 
day, monthly high watermark value, average high 
watermark for each month in the time period, or second- 
highest average daily value over a period of days. 

By way of example, Table 3 is a sample of the CE 
output data records produced from the information logs 
in Table 2 for the ABC Corp. on the "Groucho" system for 
the month of June, 1999. Note that the CE has performed 
the appropriate substitutions from the KB - provided a 
first record reflecting the CI in effect at the 
beginning of the time period, provided the CI in effect 
for each event, and inserted the appropriate "ABC model 
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group" for each output event. In the latter case, note 
that the model group information and timestamps are now 
available to the user to evaluate any change in 
licensing costs for software from ABC Corp. 

5 Table 3 - Sample CE Output Records for ABC Corp. on Groucho 6/1/99 

- 6/30/99 



Times tamp CI Model 

(MIPS) Group 

6/1/99 00:00 150 C 

6/15/99 15:11 210 E 

10 6/20/99 00:53 225 E 

6/25/99 00:00 250 E 



Usage Data Extractor (UE) : The UE 10 (Figure 1) is 
a facility which extracts information regarding software 
product usage that has been separately gathered and 

15 recorded by a monitoring program, the operating system, 

Technical License Managers (TLMs) or other programs as 
appropriate (e.g., SoftAudit, FlexLM, etc.). A 
description of a usage data extractor and reporter 
appears in the present assignee's U.S. patent number 

20 5,590,056, the contents of which are incorporated by 

reference herein. 

The UE 10, under user-control, extracts usage data 
from one or more independent information logs; 
optionally: 

25 • applying a filter to the usage data; 

• combining, as appropriate, the usage data from each 
of the information logs; and/or 
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• generating output records of the raw combined data 
The user can provide extraction specifications 

(filters) such as: 

• a particular computer system, CPU or LPAR; 
5 • a particular location or enterprise; 

• a particular software product and/or version; or 
all software products; 

• a set of software products optionally, of a 
specific version; 

10 • licensed software products; 

• products by vendor; 

• a user or group of users; and/or 

• a period of time 
Optionally, the UE 10: 

15 • extracts and returns or stores data in response to 

an API call from another process; 

• extracts and processes data as an exit routine from 
another process; 

• stores output data in a file or database according 
20 to a user-specified format such as comma separated 

variables (CSV) , tab separated variables, plain 
text, XML, etc . ; 

• accesses the usage information logs of one or more 
computer systems from a remote location using a 

25 communication network or dial-up access; 

• accesses the usage information logs from one or 
more remote computer systems which have been 
downloaded to the computer system upon which the UE 
10 executes ; and/or 
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• sends extracted data to another computing facility, 
e.g., a central clearinghouse of such data. 
Carrying the preceding sample further, the UE 10 is 
tasked with extracting all usage information for the 
5 Groucho system; for the month of June, 1999; on a daily 

basis; for all software products from ABC Corp. A sample 
of the results is shown in Table 4. 



Table 4 - Sample UE Output 



Timestamp 


Product 


Users 


Jobs 


CPU 










seconds 


6/1/99 


ABCview 


2 


14 


43 . 3 


6/1/99 


ABCaudit 


1 


2 


3219 . 1 


6/2/99 


ABCaudit 


1 


1 


21 


6/16/99 


ABCview 


4 


23 


18 


6/29/99 


ABCaudit 


1 


2 


1421 


6/29/99 


ABCview 


3 


27 


21 



Data- Combiner (DC) : The DC 55 is a facility which 
first merges capacity event data extracted by the CE 30 
with software product usage data that has been extracted 
by the UE 10 and then performs various calculations 
20 (such as normalization) and outputs the data according 

to user-specifications. 

The DC 55 operates on the two types of data by: 

• combining usage data with computing capacity event 
data, optionally, generating output records of the 

25 raw combined data; 

• optionally, sorting, correlating, filtering and 
performing various user-specified calculations and 
generating output records of those calculations; 
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• optionally, generating output records in the very 
same format as the software usage reporting 
program (s) that originally created the usage data 
with the appropriate usage fields having been 

5 replaced by normalized numbers 

Optionally, the DC: 

• storing output data in a file or database according 
to a user-specified format such as CSV, TSV, plain 
text, XML, etc. 

10 • sending output data to another computing facility, 

e.g., a central clearinghouse of such data. 
As already noted, an optional facility of the DC 55 
is to generate output records in a format that is 
compatible with various software usage reporting 

15 programs (such as SoftAudit's REPORTER) that originally 

created the usage data with the appropriate usage fields 
having been replaced by normalized numbers. Thus, the 
normalized usage log can then be used by whatever 
processes would otherwise have been used by the original 

20 usage log. Programs such as REPORTER generate reports by 

reading usage data from files that have been prepared in 
a specified format, typically by another program from 
the same vendor. Under user-control, the DC 55 generates 
output records in the very same format with the 

25 appropriate usage fields having been replaced by 

normalized numbers. Using the DC generated normalized 
usage records as input, the reporting program generates 
reports that reflect normalized statistics. For example, 
if the CPU-seconds field is replaced by normalized 
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values; the output report presents a uniform measure of 
software usage. 

Normalization: Various methods are available to the 
DC 55 and UR 20 for normalizing usage data using a 
5 computing index such as processor speed (e.g., MIPS, 

total MIPS, MSUs, etc.), number of logical partitions 
(LPARs) , LPAR capacity (MIPS, memory size, etc.), number 
of processors, or other physical characteristics and 
configuration settings . 

10 For example, if a baseline of 150 MIPS is 

established for performance and job accounting analysis 
then, for any processor or LPAR, the normalized number 
(XCS) of CPU-seconds used by a job is calculated 
according to the formula: 

15 XCS = CPU-seconds x MIPS / 150 

Hence, if a job is run in an LPAR having a 150 MIPS 
capacity one day and, 200 MIPS capacity the next, the 
normalized usage (XCS) will provide the user with a 
consistent measure of resource usage. 

20 Other methods of normalization and processing CI 

and usage data over a period of time include running 
averages, high watermark usage, user-MIPS (product of 
current number of users and MIPS), etc. 

Optionally, the user may specify a formula to be 

25 used in the normalization and output of usage data. The 

formula can specify how the data in certain DC fields 
are to be used; various scaling factors such as 
cost/cpu-second; and how to normalize data for specific 
instances, e.g., according to a specific LPAR or 
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LOCATION. 

Carrying the preceding example further, the DC 
combines the CE and UE output data, Tables 3 and 4, 
respectively, and applies the above formula to normalize 
the CPU- seconds data. The output of the DC (Table 5) is 
in a format compatible with SoftAudit's REPORTER 
program. Note that the normalized data is reflected in 
the last three entries. 

Table 5 - Normalized Output Usage Data 

Timestamp Product Users Jobs Normalize 



d CPU 
seconds 



6/1/99 


ABCview 


2 


14 


43 .3 


6/1/99 


ABCaudit 


1 


2 


3219 . 1 


6/2/99 


ABCaudit 


1 


1 


21 


6/16/99 


ABCview 


4 


23 


25.2 


6/29/99 


ABCaudit 


1 


2 


2368 .3 


6/29/99 


ABCview 


3 


27 


35 



Usage Reporter (UR) : The UR 2 0 is a process which 
uses DC output data to sort, correlate, consolidate, 
summarize, format and output reports that have 
normalized usage statistics based upon user-specified 
parameters and formulae. As the data is read, the UR 20 
computes the appropriate usage statistics applying the 
current capacity index factors. If an event denotes a 
change in a capacity factor, the UR 2 0 may note that in 
the output report and then apply the new capacity 
factors in its calculations. 

For example, the user can specify that the UR 2 0 
generate a report based upon a user- specif ied Model 
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Group such as a CI in the range of 0-100 MIPS is Model 
Group A, 100-150 MIPS is Model Group B, etc. 
Accordingly, minor changes in CI are reported in favor 
of cumulative changes in capacity that cross from one 
Model Group factor to another. _ 



For completeness and summary, Figure 1 illustrates 
the usage report 10 and its constituent components 
including ]a section 12 that collects software usage data 
and stores the raw information in a usage log 14 . The 
component* 16 extracts some of the data, stores it in a 
software/usage log 18 and the reporter 2 0 generates the 
standai^W^d reports, as in the prior art exemplified by 
the 5 ' l] ^Q^ l ^S^vatenp n . i . 



The capacity extractor 3 0 includes the component 32 
15 which waits for a change in capacity to occur and stores 

at component 34 the event information in an information 
log and then stores events in an event log 36, using the 
generator 38 to generate capacity reports. 

The functions of the elements 10 and 3 0 in Figure 1 
20 are combined in Figure 2 in a system according to the 

invention which uses a modified usage extractor 44 that 
extracts usage data from the usage log 14 as indicated 
by component 46 and applies various filtering and 
selection criteria as indicated by component 48. 
25 . Simultaneously, the capacity extractor 50 accesses 

the event log 3 6 and the knowledge base 42 which 
contains various correlation data to obtain or extract 
capacity event data as indicated by element 52 . The 
filter 54 reduces that data which is then combined with 
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data obtained from the usage extractor 44 in the data 
combiner 55 which includes the component 56 which 
combines and normalizes data. The component 58 then 
applies further filtering and the outputer 60 outputs 
5 normalized data records to a normalized usage data log 

64 as well as to a software usage log 14a. The element 
62 can generate separate normalized data reports^. 
However, the same information can be obtained from the 
software usage log 14a and used by a standard reporter 

10 program 20a which generates reports from a conventional 

usage extractor program 10, such as Isogon's SoftAudit 
product, which is- illustrated in Figure 1. 

Although the present invention has been described 
in relation to particular embodiments thereof, many 

15 other variations and modifications and other uses will 

become apparent to those skilled in the art. It is 
preferred, therefore, that the present invention be 
limited not by the specific disclosure herein, but only 
by the appended claims . 
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